Search CORE

164 research outputs found

MAPU 2.0: high-accuracy proteomes mapped to genomes

Author: Anthonypillai
Ashburner
Craig
Craig
Desiere
Deutsch
Dowell
E. Birney
F. Gnad
Foster
Gnad
Jones
Kersey
M. Mann
M. Oroshi
Olsen
Ong
Publication venue: Oxford University Press
Publication date: 01/01/2009
Field of study

The MAPU 2.0 database contains proteomes of organelles, tissues and cell types measured by mass spectrometry (MS)-based proteomics. In contrast to other databases it is meant to contain a limited number of experiments and only those with very high-resolution and -accuracy data. MAPU 2.0 displays the proteomes of organelles, tissues and body fluids or conversely displays the occurrence of proteins of interest in all these proteomes. The new release addresses MS-specific problems including ambiguous peptide-to-protein assignments and it provides insight into general functional features on the protein level ranging from gene ontology classification to comprehensive SwissProt annotation. Moreover, the derived proteomic data are used to annotate the genomes using Distributed Annotation Service (DAS) via EnsEMBL services. MAPU 2.0 is a model for a database specifically designed for high-accuracy proteomics and a member of the ProteomExchange Consortium. It is available on line at http://www.mapuproteome.com

Crossref

PubMed Central

From DNA sequence to application: possibilities and complications

Author: A Bruttin
A Bruttin
A Kraus
A Nauta
A Nauta
A Nauta
A Sazicu
AK Aggarwal
B Christiansen
B Christiansen
BC Lokman
BC Lokman
BM Chassy
C Delorme
C Hill
C Schouler
CA Alpert
CA Shearman
CJ Hueck
CO Pabo
CT Verrips
D Botstein
D Lillehaug
D Lillehaug
D Sinderen
DJ O’Sullivan
DJ O’Sullivan
DO Gostick
DO Gostick
E Küster
E Maguin
E Stanley
EJ Luesink
EJ Luesink
EJ Luesink
F Desiere
F Desiere
G Buist
G Buist
G Ramsay
G Vriend
GM Djordjevic
GM Djordjevic
H Brüssow
H Holo
H Neve
I Biswas
J Bardowski
J Bernhardt
J Green
J Law
J Payne
J Stülke
JB Luchansky
JD Boyce
JW Sanders
JW Sanders
JW Sanders
K Chung
K Leenhouts
K Leenhouts
K Leenhouts
K Schnetz
KI Kodaira
KJ Leenhouts
L Dupont
L Dupont
LJ Beamer
LR Garcia
M Curie
M de Guchte
M de Guchte
M Lieb
M Ptashne
MG Johnsen
MJ Gasson
MJ Weickert
MM Sheehan
MW Lubbers
MW Lubbers
N Goupil-Feuillerat
OP Kuipers
P Hals
PG Ruyter
PGGA Ruyter
PL Madsen
PS Chandry
R Parreira
R Schmid
R Young
R Young
RR Raya
S Challou
S Lucchini
S Lucchini
S Simonen
S Spiro
S Spiro
SA Walker
SB Mclvilc
SG Kim
SM Heller
SM Madsen
SR Swindell
T Sasaki
TM Ramseier
V Vagner
WM Vos
WM Vos
WM Vos
Y Fujita
Publication venue
Publication date: 01/01/1999
Field of study

The development of sophisticated genetic tools during the past 15 years have facilitated a tremendous increase of fundamental and application-oriented knowledge of lactic acid bacteria (LAB) and their bacteriophages. This knowledge relates both to the assignments of open reading frames (ORF’s) and the function of non-coding DNA sequences. Comparison of the complete nucleotide sequences of several LAB bacteriophages has revealed that their chromosomes have a fixed, modular structure, each module having a set of genes involved in a specific phase of the bacteriophage life cycle. LAB bacteriophage genes and DNA sequences have been used for the construction of temperature-inducible gene expression systems, gene-integration systems, and bacteriophage defence systems. The function of several LAB open reading frames and transcriptional units have been identified and characterized in detail. Many of these could find practical applications, such as induced lysis of LAB to enhance cheese ripening and re-routing of carbon fluxes for the production of a specific amino acid enantiomer. More knowledge has also become available concerning the function and structure of non-coding DNA positioned at or in the vicinity of promoters. In several cases the mRNA produced from this DNA contains a transcriptional terminator-antiterminator pair, in which the antiterminator can be stabilized either by uncharged tRNA or by interaction with a regulatory protein, thus preventing formation of the terminator so that mRNA elongation can proceed. Evidence has accumulated showing that also in LAB carbon catabolite repression in LAB is mediated by specific DNA elements in the vicinity of promoters governing the transcription of catabolic operons. Although some biological barriers have yet to be solved, the vast body of scientific information presently available allows the construction of tailor-made genetically modified LAB. Today, it appears that societal constraints rather than biological hurdles impede the use of genetically modified LAB.

Crossref

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

University of Groningen Digital Archive

Dissertations of the University of Groningen

PRIDE: new developments and new datasets

Author: A. F. Quinn
Anthonypillai
Ashburner
Bantscheff
Bard
Cote
Cote
Craig
D. Thorneycroft
Desiere
Durinck
H. Hermjakob
Hamacher
Jones
Joshi-Tope
L. Martens
Lo
Martens
Mead
P. Jones
Pan
Peri
Phan
Prince
R. G. Cote
S. Klie
S. Y. Cho
Schomburg
Siepen
Zheng
Publication venue: Oxford University Press
Publication date: 01/01/2008
Field of study

The PRIDE (http://www.ebi.ac.uk/pride) database of protein and peptide identifications was previously described in the NAR Database Special Edition in 2006. Since this publication, the volume of public data in the PRIDE relational database has increased by more than an order of magnitude. Several significant public datasets have been added, including identifications and processed mass spectra generated by the HUPO Brain Proteome Project and the HUPO Liver Proteome Project. The PRIDE software development team has made several significant changes and additions to the user interface and tool set associated with PRIDE. The focus of these changes has been to facilitate the submission process and to improve the mechanisms by which PRIDE can be queried. The PRIDE team has developed a Microsoft Excel workbook that allows the required data to be collated in a series of relatively simple spreadsheets, with automatic generation of PRIDE XML at the end of the process. The ability to query PRIDE has been augmented by the addition of a BioMart interface allowing complex queries to be constructed. Collaboration with groups outside the EBI has been fruitful in extending PRIDE, including an approach to encode iTRAQ quantitative data in PRIDE XML

Crossref

Ghent University Academic Bibliography

PubMed Central

MPG.PuRe

Identification of ubiquitin/ubiquitin-like protein modification from tandem mass spectra with various PTMs

Author: AL Chernorudskiy
B MacLean
B Yan
BT Hansen
C Kang
Chiyong Kang
D Tsur
DC Chamrad
DM Creasy
E Ahrné
E Ahrné
ES Witze
F Desiere
Gwan-Su Yi
H Lee
J Rodriguez
JimmyK Eng
JR Yates
JR Yates
PGA Pedrioli
RK Meray
SM Jeram
T Srikumar
W Zhang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

A DIGE study on the effects of salbutamol on the rat muscle proteome - an exemplar of best practice for data sharing in proteomics

Author: Andrew R Jones
AR Jones
C Hoogland
C Hoogland
CF Taylor
CF Taylor
DJ Slotta
DW Huang
F Desiere
F Gibson
F Gibson
H Barsnes
J Alberto Medina-Aunon
JA Medina-Aunon
Jenna Kenyani
Jonathan M Wastling
Juan-Pablo Albar
M Ashburner
M Unlu
P Jones
PA Binz
PH O'Farrell
PJ Domann
R Craig
S Martínez-Bartolomé
S Orchard
Salvador Martinez-Bartolomé
UK Laemmli
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

BACKGROUND: Proteomic techniques allow researchers to perform detailed analyses of cellular states and many studies are published each year, which highlight large numbers of proteins quantified in different samples. However, currently few data sets make it into public databases with sufficient metadata to allow other groups to verify findings, perform data mining or integrate different data sets. The Proteomics Standards Initiative has released a series of "Minimum Information About a Proteomics Experiment" guideline documents (MIAPE modules) and accompanying data exchange formats. This article focuses on proteomic studies based on gel electrophoresis and demonstrates how the corresponding MIAPE modules can be fulfilled and data deposited in public databases, using a new experimental data set as an example. FINDINGS: We have performed a study of the effects of an anabolic agent (salbutamol) at two different time points on the protein complement of rat skeletal muscle cells, quantified by difference gel electrophoresis. In the DIGE study, a total of 31 non-redundant proteins were identified as being potentially modulated at 24 h post treatment and 110 non redundant proteins at 96 h post-treatment. Several categories of function have been highlighted as strongly enriched, providing candidate proteins for further study. We also use the study as an example of best practice for data deposition. CONCLUSIONS: We have deposited all data sets from this study in public databases for further analysis by the community. We also describe more generally how gel-based protein identification data sets can now be deposited in the PRoteomics IDEntifications database (PRIDE), using a new software tool, the PRIDESpotMapper, which we developed to work in conjunction with the PRIDE Converter application. We also demonstrate how the ProteoRed MIAPE generator tool can be used to create and share a complete and compliant set of MIAPE reports for this experiment and others

Keele Research Repository

Crossref

Springer - Publisher Connector

PubMed Central

The Drosophila melanogaster PeptideAtlas facilitates the use of peptide data for improved fly proteomics and genome annotation

Abstract Background Crucial foundations of any quantitative systems biology experiment are correct genome and proteome annotations. Protein databases compiled from high quality empirical protein identifications that are in turn based on correct gene models increase the correctness, sensitivity, and quantitative accuracy of systems biology genome-scale experiments. Results In this manuscript, we present the <it>Drosophila melanogaster </it>PeptideAtlas, a fly proteomics and genomics resource of unsurpassed depth. Based on peptide mass spectrometry data collected in our laboratory the portal <url>http://www.drosophila-peptideatlas.org</url> allows querying fly protein data observed with respect to gene model confirmation and splice site verification as well as for the identification of proteotypic peptides suited for targeted proteomics studies. Additionally, the database provides consensus mass spectra for observed peptides along with qualitative and quantitative information about the number of observations of a particular peptide and the sample(s) in which it was observed. Conclusion PeptideAtlas is an open access database for the <it>Drosophila </it>community that has several features and applications that support (1) reduction of the complexity inherently associated with performing targeted proteomic studies, (2) designing and accelerating shotgun proteomics experiments, (3) confirming or questioning gene models, and (4) adjusting gene models such that they are in line with observed <it>Drosophila </it>peptides. While the database consists of proteomic data it is not required that the user is a proteomics expert.</p

Repository for Publications and Research Data

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

The utility of mass spectrometry-based proteomic data for validation of novel alternative splice forms reconstructed from RNA-Seq data: a preliminary assessment

Author: A Mortazavi
AI Nesvizhskii
AI Nesvizhskii
AI Nesvizhskii
AI Nesvizhskii
Alexey I Nesvizhskii
B MacLean
B Zybailov
C Trapnell
D Fermin
D Karolchik
D Maglott
DJ Pagliarini
E Melamud
ET Wang
F Birzele
F Desiere
F Mo
J Cox
JC Castle
JS Aaronson
K Ning
KA Power
Kang Ning
L Wu
MS Boguski
NJ Edwards
PJ Kersey
R Aebersold
R Craig
RJ Slebos
SF Altschul
WM Old
Z Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

mspecLINE: bridging knowledge of human disease with the proteome

Author: AM Cohen
B Ye
BJ Stapley
BT Alako
C Bennett
CC van der Eijk
DJ Slotta
E Keogh
Eric W Deutsch
EW Deutsch
F Desiere
H Liao
H Liu
HJ Lowe
J Boyle
J Saltz
Jeremy Handcock
John Boyle
M Li
M Li
M Li
MY Brusniak
P Khatri
P Mallick
P Picotti
P Shannon
PA Covitz
R Cilibrasi
R Cilibrasi
R Homayouni
RL Cilibrasi
S Deerwester
V Lange
Y Tsuruoka
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Public proteomics databases such as PeptideAtlas contain peptides and proteins identified in mass spectrometry experiments. However, these databases lack information about human disease for researchers studying disease-related proteins. We have developed mspecLINE, a tool that combines knowledge about human disease in MEDLINE with empirical data about the detectable human proteome in PeptideAtlas. mspecLINE associates diseases with proteins by calculating the semantic distance between annotated terms from a controlled biomedical vocabulary. We used an established semantic distance measure that is based on the co-occurrence of disease and protein terms in the MEDLINE bibliographic database. Results The mspecLINE web application allows researchers to explore relationships between human diseases and parts of the proteome that are detectable using a mass spectrometer. Given a disease, the tool will display proteins and peptides from PeptideAtlas that may be associated with the disease. It will also display relevant literature from MEDLINE. Furthermore, mspecLINE allows researchers to select proteotypic peptides for specific protein targets in a mass spectrometry assay. Conclusions Although mspecLINE applies an information retrieval technique to the MEDLINE database, it is distinct from previous MEDLINE query tools in that it combines the knowledge expressed in scientific literature with empirical proteomics data. The tool provides valuable information about candidate protein targets to researchers studying human disease and is freely available on a public web server.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Tandem mass spectrometry data quality assessment by self-convolution

Author: A Shevchenko
AA Bharath
AL McCormack
AL McCormack
Andrew Keller
Bin Ma
BJ Cargile
C Yu
CG Herbert
D Fenyo
DC Barbacci
DL Tabb
DN Perkins
F Desiere
HI Field
JE Elias
JE Syka
Jimmy K Eng
JK Eng
JV Puymbrouck
K Biemann
K Biemann
Keng Wah Choo
KR Clauser
LY Geer
M Kinter
M Mann
Marshall Bern
N Zhang
P Roepstorff
PA Pevzner
Purvine Samuel
RA Zubarev
Randy J Arnold
Richard S Johnson
RS Johnson
S Sunyaev
Salmi Jussi
VH Wysocki
Wai Mun Tham
Wu Fang-Xiang
Wu Yik-Chung
Z Zhang
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Many algorithms have been developed for deciphering the tandem mass spectrometry (MS) data sets. They can be essentially clustered into two classes. The first performs searches on theoretical mass spectrum database, while the second based itself on <it>de novo </it>sequencing from raw mass spectrometry data. It was noted that the quality of mass spectra affects significantly the protein identification processes in both instances. This prompted the authors to explore ways to measure the quality of MS data sets before subjecting them to the protein identification algorithms, thus allowing for more meaningful searches and increased confidence level of proteins identified. Results The proposed method measures the qualities of MS data sets based on the symmetric property of b- and y-ion peaks present in a MS spectrum. Self-convolution on MS data and its time-reversal copy was employed. Due to the symmetric nature of b-ions and y-ions peaks, the self-convolution result of a good spectrum would produce a highest mid point intensity peak. To reduce processing time, self-convolution was achieved using Fast Fourier Transform and its inverse transform, followed by the removal of the "DC" (Direct Current) component and the normalisation of the data set. The quality score was defined as the ratio of the intensity at the mid point to the remaining peaks of the convolution result. The method was validated using both theoretical mass spectra, with various permutations, and several real MS data sets. The results were encouraging, revealing a high percentage of positive prediction rates for spectra with good quality scores. Conclusion We have demonstrated in this work a method for determining the quality of tandem MS data set. By pre-determining the quality of tandem MS data before subjecting them to protein identification algorithms, spurious protein predictions due to poor tandem MS data are avoided, giving scientists greater confidence in the predicted results. We conclude that the algorithm performs well and could potentially be used as a pre-processing for all mass spectrometry based protein identification tools.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Genomic structure and insertion sites of Helicobacter pylori prophages from various geographical origins

Author: A Covacci
A Kalia
A Untergasser
AE Darling
B Bjorkholm
BJ Marshall
C Canchaya
D Falush
D Falush
D Falush
D Kersulyte
D Kersulyte
D Kersulyte
D Kersulyte
DA Baltrus
DR Zerbino
EN Schmid
F Corpet
F Desiere
F Golais
FF Vale
FF Vale
FF Vale
G Morelli
H Brussow
HE Heintschel von
I Kobayashi
I Milne
I Vitoriano
J Uchiyama
J Uchiyama
JA Gama
JK Pritchard
JM Thiberge
JP Gomes
K Katoh
K Tamura
K Yahara
K Zhou
LC Fortier
LM Bobay
LM Bobay
M Kimura
M Oleastro
MF Go
P Olbermann
PZ Kozbial
R Feiner
R Grande
R Grande
RA Alm
RK Aziz
S Censini
S Goh
S Kuno
SF Altschul
T Ooka
W Bao
WH Pope
X Fan
X Wang
X Wang
Y Furuta
Y Gelfand
Y You
Y Zhou
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

We present the full genomic sequences, insertion sites and phylogenetic analysis of 28 prophages found in H. pylori isolates from patients of distinct disease types, ranging from gastritis to gastric cancer, and geographic origins, covering most continents. The gentic diversity of H pylori is known to be influenced by these genomic elements including prophages who’s geneomes range from 22.6 to 33.0 Kbp. There was a high conservation of integration site shared in over 50% of cases with greater than 40% or prophage genomes harbouring insertion sequences (IS). Furthermore prophage genomes present a robust phylogeographic pattern, revealing four distinct clusters: one African, one Asian and two European prophage populations. There was evidence of recombination within the genome of some prophages, which resulted in genome mosaics composed by different populations, which may yield additional H. pylori phenotypes

Crossref

PubMed Central

Cronfa at Swansea University

UM Digital Repository

Repositório Científico do Instituto Nacional de Saúde